1-way quantum finite automata: strengths, 
weaknesses and generalizations 



We study 1-way quantum finite automata (QFAs). First, we compare them with 
their classical counterparts. We show that, if an automaton is required to give the 
correct answer with a large probability (greater than 7/9), then any 1-way QFA can be 
simulated by a 1-way reversible automaton. However, quantum automata giving the 
correct answer with smaller probabilities are more powerful than reversible automata. 

Second, we show that 1-way QFAs can be very space-efficient. We construct a 1- 
way QFA that is exponentially smaller than any equivalent classical (even randomized) 
finite automaton. We think that this construction may be useful for design of other 
space-efficient quantum algorithms. 

Third, we consider several generalizations of 1-way QFAs. Here, our goal is to find 
a model which is more powerful than 1-way QFAs keeping the quantum part as simple 
as possible. 

1 Introduction 

It is quite possible that the first implementations of quantum computers will not be fully 
quantum mechanical. Instead, they may have two parts: a quantum part and a classical part 
with a communication between two parts. In this case, the quantum part will be considerably 
more expensive than the classical part. Therefore, it will be useful to make the quantum 
part as small as possible even if it leads to some (reasonable) increases in the size of the 
classical part. This motivates the study of systems with a small quantum mechanical part. 

Quantum finite automata (QFA) is a theoretical model for such systems. |12] introduced 
both 1-way and 2-way QFAs, with emphasis on 2-way automata because they are more 
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powerful. However, the model of 2-way QFAs is not quite consistent with the idea of a 
system with a small quantum mechanical part. |12| allows superpositions where different 
parts of superposition have the head of QFA at different locations. (Even more, using such 
superpositions was the main idea in the proof that 2-way QFAs are more powerful than 
classical finite automata.) This means that the position of the head must be encoded into 
quantum state. Hence, the number of quantum states necessary to implement a 2-way QFA 
is not a constant but grows when the size of the input increases. This also makes state 
transformations more complicated (and more difficult to implement). 

Hence, we think that more attention should be given to the study of simpler models like 
1-way QFAs. A 1-way quantum automaton is a very reasonable model of computation and 
it is easy to see how it can be implemented. The finite dimensional state-space of a QFA 
corresponds to a system with finitely many particles. Each letter has a corresponding unitary 
transformation on the state-space. A classical device can read symbols from the input and 
apply the corresponding transformations to the quantum mechanical part. 

Results about 1-way QFAs in [TJ] were quite pessimistic. It was shown that the class of 
languages recognized by 1-way QFAs is a proper subset of regular languages. We continue 
the investigation of 1-way QFAs and show that, despite being limited in some situations, 
they perform well in other situations. 

Our first results consider relations between 1-way QFAs and 1-way reversible automata. 
Clearly, a 1-way reversible automaton is a special case of a QFA and, therefore, cannot 
recognize all regular languages. It is a natural question whether 1-way QFAs are more 
powerful than 1-way reversible automata. Interestingly, the answer depends on the accepting 
probability of a QFA. If a QFA gives a correct answer with a large probability (greater than 
7/9), it can be replaced by a 1-way reversible automaton. However, this is not true for 0.68... 
and smaller probabilities. 

Then, we show that QFAs can be much more space-efficient than deterministic and even 
probabilistic finite automata. Namely, there is a 1-way QFA that can check whether the 
number of letters received from the input is divisible by a prime p with only O(logp) states 
(this is equivalent to log log p bits of memory). Any deterministic or probabilistic finite 
automaton needs p states (logp bits of memory). We think that this space-efficient quantum 
algorithm may be interesting for design of other quantum algoritms as well. 

Finally, we consider modifications of 2-way quantum automata where the head is always 
at the same position for all parts of superposition. Modified 2-way QFAs can be implemented 
with a quantum system of constant size. Several modifications are proposed. In one of our 
models (1-way QFAs with a probabilistic preprocessing), some non-regular languages can be 
recognized. 



2 Definitions 



2.1 Quantum finite automata 



We consider 1-way quantum finite automata (QFA) as defined in []12| |. Namely, a 1-way QFA 



is a tuple M = (Q, E, 5, go, Qacc, Qrej) where Q is a finite set of states, E is an input alphabet, 
5 is a transition function, q G Q is a starting state and Qacc C Q and Qrej C Q are sets of 
accepting and rejecting states. The states in Q aC c and Q re j are called halting states and the 
states in Qnon = Q — (Q aC c U Qrej) are called non-halting states, c 7 and $ are symbols that 
do not belong to E. We use c 7 and $ as the left and the right endmarker, respectively. The 
working alphabet of M is T = E U {c 7 , $}. 

A superposition of M is any element of h{Q) (the space of mappings from Q to C with 
Z2 norm). For q G Q, \q) denotes the unit vector with value 1 at q and elsewhere. All 
elements of h{Q) can be expressed as linear combinations of vectors \q). We will use if> to 
denote elements of h{Q)- 

The transition function 5 maps Q x T x Q to (C. The value a, 52 ) is the amplitude 
of I (72) in the superposition of states to which M goes from \qi) after reading a. For a G T, 
V a is a linear transformation on ^(Q) defined by 

V a {\qi))= E %i,a,g 2 )|g 2 ). (1) 
92 e<2 

We require all V a to be unitary. 

The computation of a QFA starts in the superposition [go) - Then transformations corre- 
sponding to the left endmarker c 7 , the letters of the input word x and the right endmarker $ 
are applied. The transformation corresponding to a G T consists of two steps. 

1. First, V a is applied. The new superposition if)' is V a (ip) where if) is the superposition 
before this step. 

2. Then, if)' is observed with respect to the observable E acc © E re j © E non where E acc = 
span{\q) : q G Q acc }, E rej = span{\q) : q G Q re j}, E non = span{\q) : q G Q non }- This 
observation gives x G Ei with the probability equal to the amplitude of the projection 
of if}'. After that, the superposition collapses to this projection. 

If we get ip' G E acc , the input is accepted. If ip' G E re j, the input is rejected. If 
if)' G E non , the next transformation is applied. 

We regard these two transformations as reading a letter a. 



Another definition of QFAs. Independently of [12|, quantum automata were intro- 



duced in [ 1 3 1 . There is one difference between these two definitions. In [12], a QFA is 



observed after reading each letter (after doing each V a ). In |Tj|, a QFA is observed only after 
all letters have been read. Any language recognized by a QFA according to the definition of 
1 13] is recognized by a QFA according to [JEJ. The converse is not true. Any finite language 



can be recognized in the sense of |12|| . However, no finite non-empty language can be recog- 



nized in the sense of ||13|| . Everywhere in this paper, we will use the more general definition 



of ||12|| . However, our results of section |4.1| which show that 1-way QFAs can be more space- 
efficient than deterministic or probabilistic automata are true in the more restricted model 
of [O] as well. 



2.2 Example 

To explain our notation, we give an example of a 1-way QFA. To keep it simple, we use a one 
letter alphabet £ = {a}. The state space is Q = {qo,qi,q aC c,Qrej} with the set of accepting 
states Q acc = {q aC c} and the set of rejecting states Q re j = {q re j}- The starting state is go- 

The transition function can be specified in two ways: by specifying 5 or by specifying V x 
for all letters x G T. These methods are equivalent: all V x are determined by 5 and equation 
(|I|). We shall define the automaton by describing V x . 

Va(\qo)) = gko) + ^\qi) + ^kej), 

VaQqi)) = \\qo) + - A^lqrej), 

V$(\q }) = \q re j}, V$(|gi)) = |g acc ). 

It can be also defined by describing 5. For example, V a (\q )) = §|go) + §|<7i) + r^krei) would 
be 

1 1 

5{q ,a,q ) = -,S(q ,a,qi) = -, 

S(qo, a, q acc ) = 0,5(q , a, q rej ) = -^=. 

As we see, this is much longer. For this reason, we will mainly use V x notation. 

There are some transitions that we have not described. For example, V a (q aC c) has not 
been specified. These values are not important and can be arbitrary. We need them to be 
such that V a is unitary but this is not difficult. As long as all specified V a (qi) are orthogonal, 
the remaining values can be assigned so that the whole V a is unitary. In the sequel, we will 
often shorten descriptions of QFAs by leaving out transitions that can be defined arbitrarily. 

Next, we show how this automaton works on the word aa. 

1. The automaton starts in \q ). Then, V a is applied, giving ||go) + + "^jkrej)- This 

is observed. Two outcomes are possible. With probability (l/-\/2) 2 = 1/2, a rejecting 
state is observed. Then, the superposition collapses to \q re j), the word is rejected and 
the computation terminates. Otherwise (with probability 1/2), a non-halting state is 
observed and the superposition collapses to ll^o) + ||<?i)- 111 this case, the computation 
continues. 



2. A simple computation shows that §|<Zo) + f |<Zi) is mapped to itself by V a . After that, 
a non-halting state is observed. (There are no accepting or rejecting states in this 
superposition.) 

3. Then, the word ends and the transformation V$ corresponding to the right endmarker 
$ is done. It maps the superposition to \\q r ej) + \\qacc)- This is observed. With 
probability (1/2) 2 = 1/4, the rejecting state q re j is observed. With probability 1/4, 
the accepting state q acc is observed. 

The total probability of accepting is 1/4, the probability of rejecting is 1/2 + 1/4 = 3/4. 
2.3 Reversible automata 

A 1-way reversible finite automaton (RFA) is a QFA with S(qi, a, q 2 ) G {0, 1} for all qi, a, q 2 . 
Alternatively, RFA can be defined as a deterministic automaton where, for any q 2 , a, there 
is at most one state q\ such that reading a in q\ leads to q 2 . We use the same definitions 
of acceptance and rejection. States are partitioned into accepting, rejecting and non-halting 
states and a word is accepted (rejected) whenever the RFA enters an accepting (rejecting) 
state. After that, the computation is terminated. Similarly to quantum case, endmarkers 
are added to the input word. The starting state is one, accepting (rejecting) states can be 
multiple. This makes our model different from both || (where only one accepting state was 
allowed) and |14| (where multiple starting states with a non-deterministic choice between 
them at the beginning were allowed). We define our model so because we want it to be as 
close to our model of QFAs as possible. 

Generally, it's hard to introduce probabilism into finite automata without losing re- 
versibility. However, there are some types of probabilistic choices that are consistent with 
reversibility. For example, we can choose the starting state probabilistically. The next 
example shows that such probabilistic choices increase the power of an automaton. 

Example. Consider the language L = {a 2n+z \n G IN}. It cannot be recognized by a 1- 
way RFA. However, there are 3 1-way RFAs such that each word in the language is accepted 
by 2 of them and each word not in the language is rejected by 2 out of 3. Hence, if we 
choose one of these three automata equiprobably, L will be recognized with the probability 
of correct answer 2/3. 

Probabilistic choices of this type can be easily done in our model of QFAs. This may lead 
to a claim that QFAs are more powerful than classical reversible automata because they can 
do such probabilistic choices. We wish to avoid such situations and to separate probabilistic 
choices from real quantum effects. 

Therefore, we define 1-way finite automata with probabilistic choices (PRFAs) and com- 
pare capabilities of QFAs with them. A PRFA is a probabilistic finite automaton such that, 
for any state q\ and any a G T, there is at most one state q 2 such that the probability of 
passing from q 2 to q\ after reading a is non-zero. Definitions of acceptance and rejection 



are similar to QFAs and RFAs. Now, the probabilistic automaton from the example above 
becomes a 1-way PRFA. 

Theorem 1 1. If a language is accepted by a 1-way RFA, it is accepted by a 1-way PRFA. 

2. If a language is accepted by a 1-way PRFA, it is accepted by a 1-way QFA with the 
same probability of correct answer. 

Proof: Easy. □ 

In section we will compare the power of 1-way QFAs and PRFAs and show that 1-way 
quantum automata can actually do more than just probabilistic choices. 



3 Capabilities of RFAs and QFAs 

3.1 QFAs with probability of correct answer above 7/9 

We characterize the languages recognized by 1-way QFAs in terms of their minimal automata. 
The minimal automaton of a language L is a 1-way deterministic finite automaton recognizing 
it with the smallest number of states. (Note: the minimal automaton can be non-reversible, 
even for some languages L that can be recognized by a 1-way RFA. The extreme case of this 



is our Theorem [12| where the smallest 1-way RFA is exponentially bigger than the minimal 
nonreversible automaton.) It is well known that the minimal automaton is unique and 
can be effectively constructed. 

Theorem 2 Let L be a language and M be its minimal automaton. Assume that there is a 
word x such that M contains states q±, q 2 satisfying: 

1- qi ^ <?2, 

2. If M starts in the state q\ and reads x, it passes to q 2 , 

3. If M starts in q 2 and reads x, it passes to q 2 , and 

4- <?2 is neither "all - accepting" state, nor "all - rejecting" state. 

Then L cannot be recognized by a 1-way QFA with probability at least 7/9 + e for any fixed 
e > 0. 



Proof: We prove the result for a slightly smaller probability of correct answer 5/6 + e 
(instead of 7/8 + e). The proof for 7/8 + e is technically more complicated. 

Let L be a language such that its minimal automaton contains the "forbidden construc- 
tion" and M be a QFA. We show that, for some word y the probability of M giving the 
correct answer to "y G L?" is less than 5/6 + e. This implies that L cannot be recognized 
with probability of correct answer being 5/6 + e. 



Figure 1: "The forbidden construction" of Theorem 2. 



For simplicity, we assume that q x is the starting state of M. We introduce some notation. 
Let Pnon(V') be the non-halting part of if) and Phait(i>) be the halting part of if). V' a — PnovVa 
is a transformation that maps if) to the non-halting part of V a (if)). If x is a word consisting 
of letters a x . . . a&, then V' x denotes V' ak . . .V ai . ip x denotes the non-halting part of the QFA's 
configuration after reading x. It is easy to see that, for any word x and letter a, if) xa = V^(if) x ). 

We recall that l 2 (Q) denotes the state-space of M with l 2 norm ||-0||. l 2 (Q) = Pace © 
E re j © E non . 

First, we prove that the state-space of M can be decomposed into two parts with different 
behavior. 

Lemma 1 There are subspaces E\, E 2 such that E non = E\ © E 2 and 

(i) IfipeEx, then V x {if)) G E x , 

(ii) If if) G E 2 , then \\V^ k (VO || — * when k — > oo. 

Proof. We define two sequences of subspaces E\, Ej, . . . and E 2 , E 2 , . . . such that E non = 
E\ © E 2 . Let E\ = {if>\if> G E non and V a {ip) G E non \ (i.e., the subspace of all if) such that 
both if) and V a (if)) have only non-halting components). E\ consists of all vectors in E non 
orthogonal to E\. For i > 1, E\ = E 1 ^ 1 fl {if)\V a {if)) G -E^ 1 } and .E, consists of all vectors in 
E non orthogonal to E\. 

Clearly, E\ D E\ D . . .. If is a proper subspace of E[, then the dimensionality of 
E{ +1 is smaller than the dimensionality of E\. This can happen only finitely many times 
because the original E\ is finite-dimensional. Hence, there is z'o such that E[ a = E{ a+1 . We 
define E\ = E[°, E 2 = E 2 °. Next, we check that both (i) and (ii) are true. 

Let if) G E x . Then, V a (ip) G E[ 0+1 = E[° = E x and V a (ip) G E non by E x C and the 
definition of It remains to prove that E 2 satisfies (ii) condition of Lemma [I]. 

Claim 1 If if) e El then P ha it{V a {V^{if)))) = t for all I < % - 1. 

Proof: By induction. If % = 1, then P/ i a^t(K l ('0)) = "0* by definition of E\. If z > 1 and 
I = 0, then P ha it{V a {if))) = t because £j C E\. 

The only remaining case is % > 1 and I > 1. Let if) 1 = V a (if>). By definition of E\, V a (ifi) G 
Sj -1 . We also have V a (ip) G E non because El" 1 C £ non . Hence, 1^(0) = ^n O n(K(V0) = 
V;(^) = V' and 

^(KW^))) = Phalt{Va{Vyi{lp'))) = t 

by if)' G and inductive assumption. □ 



Claim 2 Let ijj = ip 1 + ip 2 , ipi e E\ , ip 2 e i£| . T/ien, for all I < i — 1, 

Pfialt 

Proof: By linearity of P^, K, V^, 

n*(v.WW)) = JWK(V£ (&))) + JWWSCtfe,))). 

Claim I implies that P^(K(K^i))) = D 

Claim 3 Let j G {1, . . . , i }. There is a constant 5j > such that for any ip £ E 2 there is 
/g{0,...,j-1} such that \\P halt (V a (V^)))\\ >SjU\\- 

Proof: By induction. 

Base Case. Consider the linear transformation T x : E\ — > P acc © P re j that maps ip & E to 
the halting part of V^(^>). ||Tx|| (the norm of T\) is the minimum of ||Ti(a;)|| over all x such 
that ||x|| = 1. 

If ||Ti|| = 0, there is x G E\ such that ||x|| = 1 but ||Ti(x)|| = 0. This means that 
T\{x) = , implying that x G E\. However, E\ fl E\ = { }, leading to a contradiction. 
Hence, ||Ti|| > 0. Also, ||Ti|| < 1 because V a is unitary and projection to the halting subspace 
can only decrease the norm. 

We take 6± = \\Ti\\. Then, the halting part of V a (ip) is at least 1 1 T\ 1 1 \\ip\\ = ^iH^II- 

Inductive Case. We assume that the lemma is true for E 2 and prove it for E 2 +1 . 

We consider the transformation T i+ i mapping ip G E 2 +1 to the projection of V a (ip) on 
El If T i+1 {ip) = , then ip G E\ +l by the definition of E\ +1 . Similarly to the previous case, 
E{ +1 n E l 2 +1 = {If}. Hence, if T i+1 (ip) = If and ip G E l 2 +1 , then ip = If . This means that 
||T i+ i|| > 0. We can also prove that ||Pi+i|| < 1. 

We define 5 i+1 = 

Let El = {x G E l 2 +l and x±E l 2 }. Then, E l 2 +1 = E l 2 © E\. We also note that E\ is a 
subspace of E\. (This follows from definitions of E\ and E\.) 

To show that one of P/i a zt(V^(V^f(^))) is large enough, we represent ip G E 2 +1 as ip 2 + ip 3 , 
ip 2 G E\, ip 3 G E\. There are two cases: 

1. HV2II > 
Then, 

for some I < % — 1 because ^2 G -E2 an d we can a PPly the inductive assumption. Claim 
implies that this is also true for PhaitiYaiYa 1 ^))) ■ 



2. H^ll < 

Then, by triangle inequality, 

IhM > \M\ - \\M > (i - ^f^)U\\ > ^f^Ul 

Let ip', ip2 and i/)' 3 be the projections of V a (ip), V a (ip2), Vaifa) on E l 2 . Clearly, ip' = 
ip' 2 + ip' 3 . Triangle inequality gives us 

\m\ > \m\ - \m\ > \m\ - > ^f%n - %%ii = £f%n- 

We have ip' = P non (ip') + PhaitijP')- Again, we have two cases. 

If \\Phaitm\\ > then \\P h M')\\ > S i+1 U\\ because 8 i+1 = and 

5i < 1 because all ||Tj|| are at most 1. 

Otherwise, by triangle inequality, \\P n0 n(f)\\ > WW ~ II^WVOII > By 
inductive assumption, there is I < i — 1 such that H-Pha/tKtV^ (?//)) II > ^H^'H- There- 
fore, 

\\p h ait(v a (v; l+1 w))\\ > = s i+1 w\. 

□ 

Claim 4 There is c such that < c < 1 and, for any i/j G E 2 , t E IN, ||y^ ot (V>)|| < c*||^||- 

Proof: We take c = J 1 — 5f . 

By Claim |, one of P h ait{V a {V^{i)))) is at least 5 io ||V>||- Pnon{V a {V^{i)))) is orthogonal to 
this vector. Hence, PnoniyaiYa 1 ^))) lii a ^ mos t 

V^ iQ can be only smaller because V a is unitary and P non does not increase vectors. 
We have shown that (■?/>) < c||^||. Repeating this t times, we obtain Claim f|. □ 
Clearly, c* — > if t — ► oo. This completes the proof of Lemma [I]. □ 
Let -?/y = ^ + V'i £ Pi- We consider two cases. 

Case 1. Wj\ > 1/V3. 

Then, ||<0i|| < \/l - (1/V3) 2 = 2/y/3. This also means ||^(V>i)|| < 2/y/3. For suffi- 
ciently large i, 11^(^3)11 becomes negligible (part (ii) of Lemma [l]). Then, HVy^ll < 2/v3- 

The probability of M halting after this moment is less than 2/3. Hence, M has already 
halted with probability more than 1/3 and accepted (or rejected) with probability more 



than 1/6. This means that M cannot reject (accept) any continuation of x l with probability 
5/6. However, x % has both continuations in L and continuations not in L. Hence, M does 
not recognize L. 
Case 2. ||^|| < 

qx and q2 are different states in the minimal automaton of L. Therefore, there is a word 
y £ E* such that ?/ leads to acceptance from one of gi,g2 but not from the other one. We 
consider the distributions of probabilities on M's answers "accept" and "reject" on y and 
x l y. On one of these words, M must accept with probability at least 5/6 + e and reject 
with probability at most 1/6 — e. On the other word, M must accept with probability most 
1/6 — e and reject with probability at least 5/6 + e. Therefore, both the probabilities of 
accepting and the probabilities of rejecting must differ by at least 2/3 + 2e. This means 
that the variational distance between two probability distributions (the sum of these two 
distances) must be at least 4/3 + 4e. We show that it cannot be so large. 

First, we select an appropriate i. Let m be so large that ||V^m(^)|| < 5 for 5 = e/4. 

V x (ip^), V^ip^), ... is a sequence in a finite-dimensional space. Therefore, it has a limit 
point and there are i,j such that 

\Kj(^)-v^(^)\\<5. 

We choose i, j so that i > m. 

The difference between two probability distributions comes from two sources. The first 
is difference between ipj and ^A xi (the states of M before reading y). The second source is 

the possibility of M accepting while reading x l (the only part that is different in two words). 
We bound the difference created by each of these two sources. 
The difference ipj — ipj . can be partitioned into three parts. 



- ^ = - ^) + (^ - + - 

The first part is — ip\ = ip^ and HV'SII < The second and the third parts are both 

small. For the second part, notice that V' x is unitary on E x (because V x is unitary and V x (ip) 
does not contain halting components for ip £ Ei). Hence, V x preserves distances on E\ and 

l^-^JI = ll^-^||<5. 

The third part is ip^ xi — ip\ . = i\ri . and \\ip 2 ^ .|| < 8 because i > m. 

Next, we state two lemmas relating differences between two superpositions and differences 
between probability distributions created by observing these superpositions. The first lemma 
is by Bernstein and Vazirani[|J. 

Lemma 2 /TJ/ Let ip and <fi be such that \\ip\\ < 1, ||0|| < 1 and — <p\\ < e. Then the total 
variational distance resulting from measurements of and ip is at most 4e. 



The second lemma is our improvement of lemma |2|. 



Lemma 3 Let tp 1 and ip 2 be such that ?/> 1 _L?/> 2 . Then the total variational distance resulting 
from measurements of ip 1 and ip 1 + ip 2 is at most 



2112 



u 2 \\^wm\ 2 + \\^ 

Proof. Omitted. □ 

We apply lemma [3] to ip\ and ip 2 ,. This gives that the variational distance between 
distributions generated by tp^ and ip~^ + ip 2 , is at most 1. Then, we apply lemma |2| to two 
other parts of ipj — ipJ xi - Each of them influences the variational distance by at most 45. 
Together, the variational distance between distributions obtained by observing ipj and if)j xi 
is at most 1 + 85. 

The probability of M halting while reading x l is at most ||^|| 2 = 1/3. Adding it 

increases the variational distance by at most 1/3. Hence, the total variational distance is at 
most 4/3 + 85 = 4/3 + 2e. However, if M distinguishes y and x % y correctly, it should be at 
least 4/3 + 4e. Hence, M does not recognize one of these words correctly. □ 

Theorem 3 Let L be a language and M be its minimal automaton. If M does not contain 
the "forbidden construction" of Theorem || then L can be recognized by a 1-way reversible 
finite automaton. 



Proof. We define a non-reversibility as a tuple (qi,q2,g,a) where qi,q2,q G Q, a G S, 
qi 7^ <?2 and reading a in q\ or q2 leads to q. Let m be the number of non-reversibilities in M. 
We show how to modify M so that the number of non-reversibilities decreases. A reversible 
automaton is obtained by repeating this modification several times. 

We define a partial ordering < on non-reversibilities. (qi,q2,q,a) < (q[, q^ ■, a ') if an d 
only if one of q[ and q' 2 is reachable from q. It is easy to see that < is transitive. 

Lemma 4 < is anti-reflexive. 

Proof. For a contradiction, assume there is (qi, q2, q, a) such that (qi, q2, q, a) < (qi, q2, q, a). 
We also assume that q2 is reachable from q by reading a word y. (Otherwise, q\ is reachable 
from q and we can just exchange q\ and q2-) Then, reading x = ay leads from qi to and 
from q 2 to q2- This contradicts our assumption that M does not contain such q±,q2- d 

Hence, there is a tuple (qi,q2,q,a) that is maximal with respect to <. We create two 
copies for state q and all states reachable from q. If M reads a in qi, it passes to one copy 
of q, if it reads a in q2, it passes to the second copy. This eliminates this non- reversibility. 
Other non-reversibilities are not dublicated because they are not reachable from q. Hence, 
the number of non-reversibilities is decreased. □ 



Corollary 1 A language can be recognized by a 1-way QFA with probability 7/9 + e if and 
only if it can be recognized by a 1-way reversible finite automaton. 

Proof. Clearly, a RFA is a special case of a QFA. The other direction follows from Theorems 
§andg. □ 

This immediately implies the same result about 1-way reversible automata with proba- 
bilistic choices. For this type of automata, a stronger result can be proved. 

Theorem 4 A language can be recognized by a 1-way PRFA with probability 2/3 + e (for 
arbitrary e > 0) if and only if it can be recognized by a 1-way reversible finite automaton. 

Proof: Omitted. □ 



The example in Section 2.3 shows that Theorem is tight. 



3.2 QFAs with probability of correct answer below 7/9 

For smaller probabilities, QFAs are slightly more powerful than RFAs or even PRFAs. 

Theorem 5 The language a*b* can be recognized by a 1-way QFA with the probability of 
correct answer p = 0.68... where p is the root of p 3 + p = 1. 

Proof. We describe a 1-way QFA M accepting this language. The automaton has 4 states: 
?o,?i,?acc and q rej . Q acc = {q acc }, Q rej = {q re j}- The initial state is a/1 - p\q ) + y/p\qi). 
The transition function is 

V a {\q )) = (1 -p)ko> + yjp{l-p)\qi) + s/p\q rej ), 

K(\qi}) = \Jp(i-p)\qo) +p\qx) - >Ji-p\qrej), 

Vb(\qo)) = \q re j), Vb(\qi)) = \qi), 

m\qo}) = \q r ej), v$(\qi)) = ka CC )- 

Case 1. The input is x = a*. 

It is straightforward that 5 maps \/\ — p\q^) + \/p\qi) to itself while it receives a from 
the input. Hence, after reading a* the state remains y/1 — p\qo) + y/p\qi) and, after reading 
the right endmarker, it becomes a/1 — p\q re j) + \Zp|?occ}- This means that the automaton 
accepts with probability p. 
Case 2. The input is x = a*b + . 

Again, the state remains a/1 — p\qo) + y/p\qi) while input contains a. Reading the first b 
changes it to a/1 — p\q re j) + y/p\qi)- The non-halting part of this state is ■Jp\q\)- It is left 
unchanged by next 6s and mapped to \q aC c) after reading the right endmarker. Again, the 
accepting probability is p. 
Case 3. The input is x a*b*. 



Then, the initial segment of x is a*b + a + . After reading the first b, the state is y/1 — p\q re j) + 
y/p\qi). The automaton rejects at this moment with probability (1— p). The non-halting part 

y/p\qi) is mapped to py/1 - p\q ) + (1 -p)y/p\qi) - \Jp{l - p)\q re j) by the next V a . Then, the 
automaton rejects with probability p(l—p). The non-halting part py/1 — p|go) + (l~ p)y/p\qi) 
is unchanged by as. However, either b or right endmarker follows as and then go is mapped 
to \q re j) and the automaton rejects with probability p 2 {l — p). We add the probabilities of 
rejecting at different moments together and get that M rejects x a*b* with probability at 
least 

(1 -p) +p(l -p) +p 2 (l -p) = {I + p + p 2 ){l - p) = 

(1 ~p) = 1 -P 3 = P. 

1 — p 

□ 

It is easy to see that the minimal automaton of a* b* contains the "forbidden construction" 
of Theorem 0. Therefore, we have 

Corollary 2 There is a language that can be recognized by a 1-QFA with probability 0.68... 
but not with probability 7/9 + e. 

Proof: Follows from Theorems [| and |5|. □ 

For probabilistic computation, the property that the probability of correct answer can be 
increased arbitrarily is considered evident. Hence, it was not surprising that [|12|] wrote "with 
error probability bounded away from 1/2" about QFAs, thinking that all such probabilities 
are equivalent. However, mixing reversible (quantum computation) and nonreversible (mea- 
surements after each step) components in one model makes it impossible for QFAs. It is 
open whether a counterpart of Corollary |2| is true for 2-way QFAs. 

Corollary 3 There is a language that can be recognized by a 1-QFA with probability 0.68... 
but not by a classical 1-way reversible FA. 

This corollary can be improved by showing that even a 1-way probabilistic reversible 
automaton cannot recognize this language (and even with probability 1/2 + e). 

Theorem 6 Let L be a language and M be its minimal automaton. Assume that there are 
words x,y and M's states qi,q2 such that 

1. none of q\ and q2 is " all- accepting" or "all-rejecting" state; 

2. reading x in q\ leads to q\; 

3. reading y in gi leads to g2,' 



4- reading y in g 2 leads to g2,' 



5. there is no i > such that reading x % leads from q 2 to q 2 . 
Then L cannot be recognized by a 1-way PRFA with probability 1/2 + e, for any e > 0. 

Proof. Without the loss of generality, we assume that q\ is the starting state of M. Let M p 
be a 1-way probabilistic reversible automaton. We are going to show that, for some word x, 
the probability of M p giving the right answer on the input x is less than 1/2 + e. 

Lemma 5 For any state q and a G S + ; there is k such that < k < \Q\ and, for any 
sequence of probabilistic choices, one of the following happens: 

1. After reading a k in state q, M p returns to q; 

2. After reading a' Q ' +1 in state q, M p accepts or rejects. 

Proof. Let qo,qi,... be any sequence of non-halting states such that go = g and the 
probability that reading a causes M p to go from qi to g^+i is non-zero. If the length of the 
sequence is greater than |Q|, then some state appears twice in this sequence. We consider 
the first state which appears twice. If it is not g , then it has two preceding states: the state 
preceding it when it appears in the sequence for the first time and the state preceding it 
when it appears in the sequence for the second time. This contradicts the definition of a 
probabilistic reversible automaton. We have shown that go is the first state which appears 
twice. 

Next, assume we have two such sequences: qo,qi,... and q' ,q[,.... Let ki, k 2 be the 
smallest numbers such that ki > 0, q kl = g and k 2 > 0, q' k2 = q' , respectively. We show 
that k\ = k 2 . For a contradiction, assume that k\ > k 2 (k 2 > k\ case is similar.). Then, 
9fci-i = <?fc 2 -i (because the state q kl = q' k2 cannot have two preceding states), q kl ~ 2 = q' k2 - 2 
and so on, q kl - k2 = q' k2 - k2 = q'o — Qo- This contradicts the assumption that k\ is the smallest 
number such that k\ > and q kl = go- Hence, k\ = k 2 . □ 

Lemma 6 For any state q and a G S + ; one of the following happens: 

1. There is k such that, after reading a k in state q, M p returns to q for any sequence of 
probabilistic choices; 

2. The probability of halting after reading a k in state q tends to 1 when k — > oo. 

Proof. Let k be as in Lemma |5|. If M p always returns to g after reading a k , Lemma || is 
true. It remains to consider the case if there is a sequence of probabilistic choices for which 
M p does not return to q. Then, by Lemma §, this sequence causes M p to halt. Let p be the 
probability of returning to q after reading a k . Then, the probability of returning to q after 
reading a lk is p % . With probability 1 — p l , M p does not returns to q at some moment and 
(this is the only alternative) terminates after reading a ,fc+ '^' (or some its prefix). Clearly, 
1 — p l — > 1, if i — > oo. □ 



We note that one can use the same k — \Q\\ for all q and a. (For any q,a, k < \Q\ and 
\Q\\ is a multiple of any such k.) We shall call the states of the first type return states for a. 

Let pi be the probability of non-halting after reading x l and p = limi_ KX) pj. We select 
i so that \p — Pi\ < e. Let p'j be the probability of non-halting after reading x % yi and 
p' = linij^oo^. We select j so that j is a multiple of \Q\\ and \p' — p'j\ < e. 

Next, we compare the behaviour of M p on x l yi and x % y^x^ . These words correspond 
to different states in the minimal automaton. Hence, there is a continuation z such that 
exactly one of x l y^z and x % y^x^z is in L. 

If M p had accepted or rejected after x % yi (without seeing the right endmarker), it accepts 
(rejects) both x l y^z and x l y^x^z. It remains to consider the sequences of probabilistic 
choices where M p does not accept until x l yK 

Let q x be the state of M p after reading x % . We consider three cases: 

1. q x is not a return state for x. 

Then, reading more x's cause M p to halt with probability 1. However, the probability 
of halting after reading more than i x's is less than e (by the definition of i). Hence, 
the probability of this case is less than e. 

2. q x is not a return state for y. 

Then, reading y's cause M p to halt with probability 1. If it does not happen before 
reading y\ it happens later with probability 1. The definition of j implies that, if M 
does not halt before reading y\ then the probability of it halting later is less than e. 
Hence, the probability that q x is not a return state and M p does not halt before reading 
x l yi is less than e. 

3. q x is return state for both x and y. 

Then, reading yi causes M p to return to q x because j is a multiple of \Q\\ and reading 
x'^' causes it to return to q x as well. In both cases, it is in the same state after reading 
x l yi and after reading lyx^l and, hence, does the same thing on both x % y^z and 
x l yix^z. 

We see that the third case causes M p to react similarly on x l y^z and x % y^x [C ^'z and the prob- 
ability of the other two cases together is less than 2e. Hence, the probabilities of accepting 
these two words differ by less than 2e. However, one of them is in L and must be accepted 
with probability 1/2 + e and the second is not in L and must be accepted with probability 
at most 1/2 — e. This means that M p does not recognize L with probability 1/2 + e. □ 

The "forbidden construction" of Theorem |6] is also present in the minimal automaton of 
a*b*. Therefore, we have 

Corollary 4 There is a language that can be recognized by a 1-QFA with probability 0.68... 
but cannot be recognized by a 1-PRFA with probability 1/2 + e, for any e > 0. 



We do not know whether all languages with minimal automata not containing the con- 
struction in Theorem |6] can be recognized by 1-way PRFAs. Another open question is 
characterizating the languages recognized by 1-way QFAs in terms of "forbidden construc- 
tions". 



4 Complexity 

4.1 Divisibility by a prime 

All previous work on 1-way QFAs and the previous sections of this paper) considers 



the question what languages can be recognized by quantum automata. However, there 
is another interesting and important question: how efficient are QFAs compared to their 
classical counterparts? 

For 1-way finite automata, the most natural complexity measure is the number of states 
in the automaton. We can follow the proof in [12j that any language recognized by a 1-way 
QFA is regular step by step and add complexity bounds to it. Then, we get 

Theorem 7 Let L be a language recognized by a 1-way QFA with n states. Then it can be 
recognized by a 1-way deterministic automaton with 2°^ states. 

So, transforming a QFA into a classical automaton can cause an exponential increase in 
its size. Our next results show that, indeed, 1-way QFAs can be exponentially smaller than 
their classical counterparts. 

Let p be a prime. We consider the language L p = {a l \i is divisible by p}. It is easy to see 
that any deterministic 1-way finite automaton recognizing L p has at least p states. However, 
there is a much more efficient QFA! 

Theorem 8 For any e > 0, there is a QFA with O(logp) states recognizing L p with proba- 
bility 1 — e. 

Proof. First, we construct an automaton accepting all words in L with probability 1 and 
accepting words not in L with probability at most 7/8. Later, we will show how to increase 
the probability of correct answer to 1 — e for an arbitrary constant e > 0. 

Let Uk, for A; e {1, . . . , p — 1} be a quantum automaton with a set of states \Q\ = 
{<?o, ?i, Qacc, Qrej}, a starting state \q ), Q acc = {q acc }, Qrej = {Vrej}- The transition function is 
defined as follows. Reading a maps |g ) to cos0|g o )+^sin0|g 1 ) and \qi) to i sin <p\q )+cos(f)\qi) 
where (ft = (It is easy to check that this transformation is unitary.) Reading the right 
endmarker $ maps \q ) to \q aC c) and |gi) to \q re j)- 

Lemma 7 After reading a\ the state of Uk is 

cos I — — I \q ) +2 sin I — — I 



Proof. By induction. □ 

If j is divisible by p, then is a multiple of 2ir, cos(^^) = 1, sin(^^) = 0, reading a j 
maps the starting state \qo) to \qo) and the right endmarker $ maps it to \q acc )- Therefore, 
all automata Uk accept words in L with probability 1. 

For a word a J ^ L, call Uk "good" if Uk rejects a J with probability at least 1/2. 

Lemma 8 For any a j L, at least (p — l)/2 of all U k are "good". 

Proof. The superposition of Uk after reading a? is cos(^^)|g ) + s i n (^p)|<?i)- This is 
mapped to cos(^^)|g acc ) + sin(^^)|g re j) by the right endmarker. Therefore, the probability 
of U k accepting a* is cos 2 (^). cos 2 (^) < 1/2 if and only if |cos(^)| < l/y/2. This 
happens if and only if ^ is in [2nl + tt/4, 2tt/ + 3tt/4] or in [2nl + 5tt/4, 2tt/ + 7tt/4] for 
some / G IN. 

27r0fcmod P ) e ^ 37r ^ 4 ^ if and Qnly if 2^fc G [27r/+7r/4, 27r/+37r/4] for some I. pis a prime 
and j is relatively prime with p. Therefore, j mod p, 2j mod p, . . ., (p — l)j mod p are just 
1, 2, . . ., p — 1 in different order. Hence, it is enough to count k such that — G [vr/4, 3n/A] 

or 2^e[57r/4,77r/4]. 

We do the counting for p = 8m + 1. (Other cases are similar.) Then ^ e [7r/4,37r/4] 
if and only if m + 1 < k < 3m and E [5tt/4, 7tt/4] if and only if 5m + 1 < fc < 7m. 
Together, this gives us 4m = (p — l)/2 "good" k's. □ 

Next, we consider sequences of |~81np] fc's. A sequence is good for a J if at least 1/4 of all 
its elements are good for a J . 

Lemma 9 There is a sequence of length [8 hip] which is good for all a J ^ L. 

Proof. First, we show that at most 1/p fraction of all sequences is not good for any fixed 
a? i L. 

We select a sequence randomly by selecting each of its elements uniformly at random 
from {1, . . . ,p — 1}. The probability of selecting a good k in each step is at least 1/2. By 
Chernoff bounds, the probability that less than 1/4 = 1/2 — 1/4 fraction of all elements is 
good is at most 

e -2(l/4) 2 81np = 1 
P 

Hence, the fraction of sequences which are bad for at least one j G {1, 2, ... ,p — 1} is 
at most (p — l)/p and there is a sequence which is good for all j G {1, . . . ,p — 1}. This 
sequence is good for a- 7 ^ L with j > p as well because any Uk returns to the starting state 
after reading a p and, hence, works in the same way on a j and a j mod P. □ 

Next, we use a good sequence k±, . . . , k^inp] to construct a quantum automaton recogniz- 
ing L p . The automaton consists of U^, Uk 2 , ■ ■ ., £4 r81npl and a distinguished starting state. 
Upon reading the left endmarker c 7 , it passes from the starting state to a superposition where 
| go) states of all Uk t have equal amplitudes. 



Words in L are always accepted because all Uk accept them. For any a- 7 ^ L, at least 1/4 
of the sequence is good. This means that at least 1/4 of all reject it with probability at 
least 1/2 and the total probability of rejecting any a? ^ L is at least 1/8. 

Finally, we sketch how to increase the probability of correct answer to 1— e for an arbitrary 
e > 0. We do it by increasing the probability of correct answer for each Uk- 

Namely, we consider an automaton U' k with 2 d non-halting states where d is a constant 
depending on the required probability 1 — e. The states are labelled by strings of 0s and Is 
of length d: go...oo; <?o...oi an d so on. The starting state is the state go...oo corresponding to the 
all-0 string. The transition function is defined by 

d 

$(q Xl ...x d ,a,q yi ... yd ) = Y[S(q Xl ,a,q yi ). 

3=1 

It is easy to see that this is just the tensor product of d copies of Uk- The automaton also 
has one accepting state and 2 d — 1 rejecting states. After reading the right endmarker, the 
automaton passes to the accepting state from go...oo and to a rejecting state from any other 
state q Xl ...x d - (To ensure unitarity, one-to-one correspondence between q xl ... Xd and rejecting 
states is established.) A counterpart of Lemma [7] is 

Lemma 10 The state of U' k after reading a? is 

. [2njk\. . .. f27Tjk\. .. . f2irjk\. . .. (2wjk\ . .. 
(cos I I |g ) + 2sml I |gi)j <g> . . . <g> (cos I I \q }+ism I I |gi)J. 

d times 



The amplitude of |go) ® ■ ■ ■ ® |<?o) = 9o...o in this superposition is cos d (^^). If j is a 
multiple of p, then this is 1, meaning that words in L p are always accepted. For a J ^ L p , we 
call U' k 5-good if it rejects a J with probability at least 1 — 5. We formulate a counterpart of 
Lemma [8[ 

Lemma 11 For a suitable constant d, at least 1 — 5 of all U' k are 5-good. 

Then, we define a 5-good sequence of automata as a sequence such that, for any a J ^ L, 
at least 1 — 25 of all automata in the sequence are 5-good. Similarly to Lemma |9|, we show 
that there is a 5-good sequence U' kl , U' k2 , ... of length O(logn). Then, we consider an 
automaton consisting of U' k , U' k2 , . . . and a distinguished starting state. Upon reading the 
left endmarker c 7 , it passes from the starting state to a superposition where |go) states of 
all UL have equal amplitudes. Again, it accepts a J G L p with probability 1 because all U' k 
accept a J e L p . Words a J ^ L p are rejected by at least 1 — 25 of UL with probability 1 — 5. 
Therefore, the probability of rejecting any a J ^ L p is at least (1 — 25) (1 — 5) > 1 — 35. Taking 



5 = e/3 and choosing d so that it satisfies Lemma 11 completes the proof. □ 



We have shown an exponential gap between deterministic and quantum 1-way finite 
automata. Next, we compare quantum and probabilistic finite automata. Generally, proba- 
bilistic finite automata can recognize some languages with the number of states being close to 
the logarithm of the number of states needed by a deterministic automaton [[l], [7|. However, 
this is not the case with L p . Here, adding probabilism does not help to decrease the number 
of states at all. 

Theorem 9 Any 1-way probabilistic finite automaton recognizing L p with probability 1/2 + e, 
for a fixed e > 0, has at least p states. 

Proof. Assume that there is a 1-way probabilistic finite automaton with less than p states 
recognizing L p with probability \ + e, for a fixed e > 0. Since the language L p is in a single- 
letter alphabet, the automaton can be described as a Markov chain. We use the classification 
of Markov chains described in Section 2 of [[Hj . According to this classification, the states of 
the Markov chain (the automaton) are divided into ergodic and transient states. An ergodic 
set of states is a set which cannot be left once it is entered. A transient set of states is a 
set in which every state can be reached from every other state, and which can be left. An 
ergodic state is an element of an ergodic set. A transient state is an element of a transient 
set. 

If a Markov chain has more than one ergodic set, then there is absolutely no interaction 
between these sets. Hence we have two or more unrelated Markov chains lumped together. 
These chains may be studied seperately. If a Markov chain consists of a single ergodic set, 



the chain is called an ergodic chain. According to results in Section 2 of [|1(]] , every ergodic 
chain is either regular or cyclic. 

If a Markov chain is regular, then sufficiently high powers of the state transition matrix 
P of the Markov chain are with all positive elements. Thus no matter where the process 
starts, after sufficient lapse of time it can be in any state. Moreover, by Theorem 4.2.1 of 
TOfl there is limiting vector of probabilities of being in the states of the chain, not dependent 



of the initial state. 

If a Markov chain is cyclic, then the chain has a period d, and its states are subdivided 
into d cyclic sets (d > 1). For a given starting position, it moves through the cyclic sets in a 
definite order, returning to the set of the starting state after d steps. Hence the ci-th power 
of the state transition matrix P describes a regular Markov chain. 

We have assumed that p is prime, and the automaton has less than p states. Hence for 
every cyclic state of the automaton the value of d is strictly less than p, and because of 
primality of p, d is relatively prime to p. By D we denote the least common multiple of all 
such values d. Hence D is relatively prime to p, and so is any positive degree D n of D. Since 
l Dn (£ M p but 1 D " P e M p , the total of the probabilities to be in an accepting state exceeds 



\ + e for 1°" and is less than | — e for \ d "p. Contradiction with Theorem 4.2.1 of [10|. □ 



Corollary 5 For the language L p , the number of states needed by a classical (deterministic 
or probabilistic) 1-way automaton is exponential in the number of states of a 1-way QFA. 



Proof: Follows from Theorems ^| and |9|. □ 



4.2 Equality 

Divisibility by a prime is quite natural problem and we expect that our algorithm can be 
used as a subroutine, making other quantum algorithms more space-efficient. Here, we show 
how to use our quantum automaton for another problem as well. This problem is checking 
whether the length of the input word is equal to some constant n. 

Theorem 10 jffl Let L' n be a language consisting of one word a n in a single-letter alphabet. 

1. Any deterministic automaton that recognizes L' n has at least n states. 

2. For any e > 0, there is a probabilistic automaton with 0(log 2 n) states recognizing L' n 
with probability 1 — e. 

The first part is evident. To prove the second part, Freivalds0 used the following con- 
struction. O(^L^) different primes are employed and O(logn) states are used for every 
employed prime. At first, the automaton randomly chooses a prime p, and then the remain- 
der modulo p of the length of the input word is found and compared with the standard. 
Additionally, once in every p steps a transition to a rejecting state is made with a "small" 
probability con ^ tp . The number of used primes suffices to assert that, for every input of length 
less than n, most of primes p give remainders different from the remainder of n modulo p. 
The "small" probability is chosen to have the rejection probability high enough for every 
input length N such that both N ^ n and an e-fraction of all the primes used have the same 
remainders modp as n. 

This 1-way probabilistic automaton is reversible according to the definition of section ^. 
We can use Theorem [I] to transform it into quantum automaton with the number of states 



increasing at most twice. Then, we obtain a counterpart of Theorem [10] for quantum case. 

However, we can do better by counting modulo prime as in Theorem |[ For that, we need 
O(logp) states for each prime p (instead of p states in the probabilistic case). Each prime 
p is O(logn) and there are O (log nj log log n) of them. Therefore, the number of states in 
quantum case will be 

0( lQgn logp) = 0{ - lQgn log log n) = O(logn). 
log log n log log n 

We have shown 

Theorem 11 L' n can be recognized by a 1-way QFA with O(logn) states. 



Again, the QFA is exponentially smaller than the corresponding deterministic automaton. 



4.3 Are QFAs always space-efficient? 



Subsections |4.1| and |4.2| showed cases when 1-way QFAs are more space-efficient than their 
classical counterparts. There can be examples of different kind where deterministic finite 
automata are exponentially smaller than 1-way QFAs. The construction of theorem [| which 
transforms the minimal automaton into a 1-way RFA can increase the size of the automaton 
exponentially. The next theorem shows that this is inevitable. 

Theorem 12 Let L m = (xy\zy) m U {(xy\zyYxx\0 < i < m — 1}. Then, 

1. L m can be recognized by a 1-way deterministic finite automaton with 3m + 2 states; 

2. L m can be recognized by a 1-way reversible automaton but it requires at least 3(2 m — 1) 
states. 

After first version of this paper appeared, Ambainis, Nayak and Vaziranif^] showed that, 
for a different language, the number of states needed by a 1-way QFA is almost exponentially 
bigger than the number of states of a 1-way deterministic finite automaton. 



5 Modifications of 2-way QFAs 

The advantage of 1-way quantum automata is the simplicity of this model. However, we saw 
that 1-way automata are quite limited in several situations (despite being good in others) 
while [0 shows that 2-way QFAs are strictly more powerful than classical finite automata. 
It would be interesting to come up with a model having both advantages, i.e. being both 
powerful and simple. In the remainder, we propose several modifications of quantum au- 
tomata which are intermediate between 1-way QFAs and 2-way QFAs. Quantum part is kept 
finite in all of these models. Questions about exact power of these models are mostly open 
but we have shown that, in most of these models, all regular languages can be recognized 
and, in at least one of them, non-regular languages can be recognized as well. 



5.1 Scanning the tape multiple times 

The simplest modification is to allow a 1-way QFA to scan its input tape several times 
(after the right endmarker it goes to the left endmarker and so on). This is enough to make 



the proof from [12| that 1-way QFAs recognize only regular languages fail. If we allow the 



automaton to reject words by non-halting, a nonregular language can be recognized. 

Theorem 13 Let L = {a n b n \n e IN}. There is a 1-way QFA M scanning tape several times 
such that 



1. If x L, M stops with probability 1 after 0(\x\) scans of the tape. 



2. If x <E L, M never stops. 



If we require M to stop in a rejecting state for rejection, a similar question is still open. 
It is also open whether multiple scans of the tape can be used by a 1-way QFA to recognize 
an arbitrary regular language. (However, known proofs that 1-way QFAs do not recognize 
some regular languages also fail in this case.) 



5.2 Passing information back to environment 

Another possibility is introducing more complicated observables. We can partition all non- 
halting states into 2 or 3 classes: moving-left states, moving-right states and (may be) 
non-moving states. Then, after each step we observe whether the automaton is in accepting, 
rejecting, moving-right or moving-left state. If it is in a halting state, we terminate the 
computation. If it is in a moving-right state, we feed it the next letter (do the transformation 
on the quantum system corresponding to the next letter). If it is in a moving- left state, we 
feed it the previous letter. 



The model of section RTT] is a special case of this model where all non-halting states are 



classified as moving-right states. 



5.3 Preprocessing the input word 

In this model, we have two automata Mi and M 2 instead of one. Mi is a 2-way deterministic 
(or probabilistic) finite automaton with output and M 2 is a 1-way QFA. The input word is 
given to Mi and M 2 is run on the output of Mi. (This can be viewed as Mi preprocessing 
the input word.) Again, the model of section [5J] can be viewed as a special case of this 
model where Mi moves from left to right all the time and outputs all letters that it reads. 

Any regular language can be recognized in a trivial way because we can recognize it by 
Mi and give the result as an input to M 2 . If the preprocessing is done by a probabilistic 
automaton, we can do more. 

Theorem 14 For any e > 0, there is a 2-way probabilistic finite automaton Mi and a 1-way 
QFA M 2 such that, with probability at least 1 — e, 

1. Mi stops in time quadratic in the length of the input and, 

2. M 2 accepts the output of Mi if and only if x G {a n b n \n G IN}. 

Any 2-way probabilistic automaton that recognizes a non-regular language has an expo- 



nential expected running timeg, y, [8|, [TT|. So, neither polynomial time 2-way probabilistic 
finite automata nor 1-way QFAs can recognize non-regular languages. However, their com- 
bination can do that! 

Acknowledgments. We thank Isaac Chuang, Ashwin Nayak, Alistair Sinclair, Amnon 
Ta-Shma and Umesh Vazirani for useful discussions. 



References 



A. Ambainis, The complexity of probabilistic versus deterministic finite automata. Pro- 
ceedings of the International Symposium on Algorithms and Computation(ISAAC'96), 
Lecture Notes in Computer Science, 1178:233-239, 1996. 

A. Ambainis, A. Nayak, U. Vazirani, On the space- efficiency of 1-way quantum finite 
automata, LANL e-print |quant-ph / 9804043 . 



D. Angluin, Inference of reversible languages. Journal of the ACM, 29:741-765, 1982. 

E. Bernstein, U. Vazirani, Quantum complexity theory. SIAM Journal on Computing, 
26:1411-1473, 1997. 

CDwork and L.Stockmeyer. On the power of 2-way probabilistic finite state automata. 
Proceedings of the 30th IEEE Conference on Foundations of Computer Science(FOCS), 
1989, pp. 480-485. 

R.Freivalds. Probabilistic two-way machines. Lecture Notes in Computer Science, 
188:33-45, 1981 

R. Freivalds, On the growth of the number of states in result of determinization of 
probabilistic finite automata. Automatic Control and Computer Sciences, 1982, no. 3, 
39-42 

A.G.Greenberg and A.Weiss. A lower bound for probabilistic algorithms for finite state 
machines. Journal of Computer and System Sciences, 33:88-105, 1986. 

J. Hopcroft, J. Ullman, An Introduction to Automata Theory, Languages and Compu- 
tation. Addison- Wesley, 1979. 

J. Kemeny, J. L. Snell. Finite Markov chains. Van Nostrand, Princeton, N.J. , 1960 

J. Kaneps and R. Freivalds. Running time to recognize nonregular languages by 2-way 
probabilistic automata. Lecture Notes in Computer Science, 510:174-185, 1991. 

A. Kondacs, J. Watrous, On the power of quantum finite state automata. Proceedings 
of the 38th IEEE Conference on Foundations of Computer Science, 66-75, 1997. 

C. Moore, J. Crutchfield. Quantum automata and quantum grammars. Santa-Fe Insti- 
tute Working Paper 97-07-062, 1997. Also |quant-ph/ 9 7070311 . 



J. Pin. On reversible automata. Proceedings of Latin American Symposium on Theoret- 
ical Informatics (LATIN 7 92), Lecture Notes in Computer Science, 583:401-415, 1992. 



